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Abstract: Sequencing technology has developed rapidly in recent years. Complete or nearly complete 
mitochondrial genomes (mitogenomes) of 155 species from 47 families in Heteroptera have been sequenced. 
However, the amounts of mitogenomes between those families are unbalanced, which makes it difficult to 
correctly discern the patterns of mitogenome rearrangement in Heteroptera. Among 21 species from ten 
families, ten variations in mitogenome rearrangement had been previously reported, among which the 
translocation between tRNA-Thr and tRNA-Pro was considered as a synapomorphy of Pyrrhocoroidea based 
on two mitogenomes. As only one mitogenome in each of Largidae and Pyrrhocoridae had been sequenced to 
conclude the synapomorphy, more mitogenomes of Pyrrhocoroidea need to be explored. In this study, 
additional two mitogenomes of Pyrrhocoroidea (Macrocheraia grandis grandis (Gray, 1832) and Myrmoplasta 
mira Gerstacker, 1892) were sequenced. Both of them also possess the same translocation between tRNA-Thr 
and tRNA-Pro, which reaffirms that this kind of rearrangement is a molecular synapomorphy of 
Pyrrhocoroidea. Moreover, we discovered a more complex rearrangement in Myrmoplasta mira, in which six 
nearly identical duplications of tRNA-Thr were found located downstream of tRNA-Pro. Considering the high 
biodiversity of Heteroptera, more mitogenomic studies are needed to improve our knowledge about 
mitogenome rearrangements and the potential synapomorphies. 


Key words: Pyrrhocoroidea; mitogenome; synapomorphy; phylogenomics; evolutionary history 


EAI Myrmoplasta mira H RAAE A 2H Ze ZO ER SE AP BE CAA: FAEH: 
WKH) 

(JÆ 53, Petr KMENTŹ, Frantisek STAHLAVSKY?, HK 23, E4682 23, wg 23° 

1. PPK ee, KE 300071; 2. PLAYA BA Ra SU SS. AR PIN 510275; 
3. HLL ARE REESE GE AS SEL A, PAR TM) 510275; 4. Department of Entomology, National 


Accepted 20 March 2019. Published 25 June 2019. Published online 6 June 2019. 
(D Corresponding author, E-mail: xieq8@mail.sysu.edu.cn 


Entomotaxonomia (2019) 41(2): 96-113 97 


Museum, Cirkusova 1740, 19300 Prague 9, Czech Republic; 5. Department of Zoology, Charles University, 
Vinicna 7, CZ-128 44 Praha 2, Czech Republic 

WE: Akuna, CAM TAE H ER 47 P155 BP sc Me Bele oe ZR a PE 
KFZ. PRIM, Beefs Sie A) ZL AS) Ce ET HE BLS TRO A EAS a, by SS H Eo EUR 
AZAR. FECA NOFA, E 10 E 21 MEH AHS 10 KRAE RO EHEN X 
HHH tRNA-Thr 4 tRNA-Pro EOS SEGA AE ZUR BSE ATIE. (HAEDUT AEC ZT BP Se 
AE HEOEUS ASO TE SERAIS AE T SERA BE ST E AEREE A) LA) AOE FE HDFT 
JY Wi ZTE A uu A 28 FY] EAE (Gray, 1832) 441 Myrmoplasta mira Gerstücker, 1892). "t 
NIÆ tRNA-Thr I tRNA-Pro Z EFE ARTA, SCRAP AEE OA SE TRE. ES, 
TE Myrmoplasta mira PARTE NR ZEB EHEZ; SU: YE tRNA-Pro H FYFE 6 ML tRNA-Thr 
EN. FERAH Eis ERU AE VIA PEE, ARR rie ERE AGH SE AL er], DA si SEN AR 
Hi As ee A) E HERURCR GTAERTWR o 
üt: ZIM, AAS: SEA ATE: PAAR: Le 


Introduction 


Heteroptera is the largest suborder of Hemiptera, containing 123 families and more than 
45,200 described species (Henry 2017). Since the complete mitogenome of Drosophila yakuba 
(Diptera: Drosophilidae) was sequenced (Clary & Wolstenholme 1985), which was the first 
mitogenome sequenced in insects, complete or nearly complete mitogenomes of 155 species in 
47 families of Heteroptera have been sequenced. However, the amounts of known 
mitogenomes in different families vary widely. More than half of described mitogenomes are 
concentrated in the Reduviidae (including Phymatinae), Miridae, Tingidae, Nabidae, 
Pentatomidae and Aradidae, and only one or two mitogenomes have been sequenced in each of 
the other 30 families (Table S1), making their patterns of rearrangement hard to summarize. 
The gene order of a mitogenome is generally conserved in insects, similar to what was found 
for Drosophila yakuba. But rearrangements still exist across the 17 orders of Insecta (Wang et 
al. 2016; Chen & Du 2016). Ten kinds of rearrangements in ten families of Heteroptera had 
been previously reported (Table 1), in which tRNA-Ile and tRNA-Gin switch their positions in 
all seven mitogenomes of Aradidae. Li et al. (2016) proposed that it was likely a 
synapomorphy of Aradidae. 


Table S1. The number of sequenced mitogenome in each family of Heteroptera 


Infraorder Family The number of sequenced mitogenome 
Cimicomorpha Anthocoridae 2 

Cimicidae 1 

Miridae 13 

Nabidae 7 

Reduviidae 28 

Tingidae 13 

Velocipedidae 1 


Nepomorpha Aphelocheiridae 2 


98 MEN etal. Mitochondrial genomes of Macrocheraia grandis and Myrmoplasta mira 


Continued Table S1 


Infraorder Family The number of sequenced mitogenome 


Belostomatidae 3 
Corixidae 
Gelastocoridae 
Helotrephidae 
Micronectidae 
Naucoridae 
Nepidae 
Notonectidae 
Ochteridae 
Pleidae 
Pentatomomorpha Aradidae 
Alydidae 
Coreidae 
Rhopalidae 
Stenocephalidae 
Berytidae 
Colobathristidae 
Lygaeidae 
Malcidae 
Rhyparochromidae 
Acanthosomatidae 


Cydnidae 


WwW e N N N N e N e UU hl NRF RP WNP RNY RP FR 


Dinidoridae 


A 
P 


Pentatomidae 
Plataspidae 
Scutelleridae 
Tessaratomidae 
Urostylididae 
Largidae 
Pyrrhocoridae 
Gerromorpha Gerridae 
Hydrometridae 
Veliidae 
Leptopodomorpha Leptopodidae 
Saldidae 
Dipsocoromorpha Ceratocombidae 


Schizopteridae 


A N N U e ere Ne RP RP www rN 


Enicocephalomorpha Enicocephalidae 
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Table 1. Rearrangements in Heteroptera 


: . Rearrangements Other 
Family Species i : 
as potential synapomorphies rearrangements 
Urostylididae Urochela quadrinotata Absence of I and Q 
Largidae + Physopelta gutta T-P > P-T 
Pyrrhocoridae Dysdercus cingulatus (Hua et al. 2008) 
Aradidae Aneurus similis I-Q — Q-I 
Aneurus sublobatus (Li et al. 2016) 
Brachyrhynchus hsiaoi 
Libiocoris heissi 
Neuroctenus parus 
Aradacanthia heissi W-C ^ C-W 
Aradus compar M-ND2-W-C- Y — 
C-Y-M-ND2-W 
Nabidae Nabis apicalis Absence of I, Q and M 
Reduviidae Phymata americana Absence of I 
Brontostoma colossus ND3-A — ND3-R-A 
Reduvius tenebrosus I-Q — Q-I 
Enicocephalidae — Henschiella sp. PJ-2015 T-P-ND6-CytB-S-ND1-L-lrRN 
Oncylocotis sp. PJ-2015 A-V-srRNA-CR > 
Stenopirates sp. HL-2011 CytB-S-CR-IrRNA-V-srRNA- 
Stenopirates sp. PJ-2015 ND1-L-P-T-ND6 
Ceratocombidae ^ Ceratocombus sp. HL-2012 Translocation of R and E & 
Ceratocombus japonicus V-srRNA — srRNA-V 
Belostomatidae Lethocerus deyrollei Absence of I 


A similar opinion was proposed by Hua et al. (2008) while studying the two 
mitogenomes in Largidae and Pyrrhocoridae, which are the only two families of the 
superfamily Pyrrhocoroidea. Each contains one mitogenome with the translocation between 
tRNA-Thr and tRNA-Pro, which was considered as a synapomorphy of Pyrrhocoroidea. But it 
is still unknown whether this synapomorphy will be supported when more mitogenomes in 
Pyrrhocoroidea are revealed. To secure more mitogenome sequences and check the reliability 
of such a synapomorphy, mitogenomes of Macrocheraia grandis grandis (Gray, 1832) 
(Largidae: Physopeltinae) and Myrmoplasta mira Gerstacker, 1892 (Pyrrhocoridae) in 
Pyrrhocoroidea were sequenced and analyzed in this study. Macrocheraia grandis grandis is 
the species with the largest body size in Largidae, and Myrmoplasta mira is an ant-like 
Pyrrhocoridae. 


Material and Methods 


Samples and DNA extraction 

The specimen of Macrocheraia grandis grandis was collected by Shasha YU in China, 
Hainan, Nankai Township on July 24, 2013 and preserved in absolute ethyl alcohol at -20?C. 
The total genomic DNA was extracted from thorax tissue and the whole head using the 
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CTAB-based method (Reineke et al. 1998). The specimen of Myrmoplasta mira was collected 
by Frantisek STAHLAVSKY, Jan A NEETHLING, Vera OPATOVA and Pavel JUST in South 
Africa, Eastern Cape on October 15, 2013. This sample was preserved in absolute ethyl 
alcohol at -20°C. Total genomic DNA was extracted from a fore leg using the TIANamp Micro 
DNA Kit (TIANGEN). 


PCR amplification, sequencing and assembling 

The mitogenome of Myrmoplasta mira was amplified in several overlapping PCR 
fragments. The thermal cycling program was set as follows: an initial denaturation at 95?C for 
2 min, denaturation of 35 cycles at 95°C for 20 s, annealing at 45-62°C for 1 min, elongation 
at 72°C for 1-2 min, and a final elongation at 72°C for 10 min. PCR products were 
electrophoresed in 196 agarose gel and then sequenced by the Sanger method. The results were 
assembled by SeqMan in the DNAStar v7.1 program package. The Macrocheraia grandis 
grandis sample was quite large and had a high concentration of DNA, therefore its 
mitogenome sequence was obtained by high-throughput-sequencing. A DNA library was 
constructed with an insert size of 250 bp, and then sequenced with a 150 bp paired end on the 
Illumina platform. De novo assembly was performed by SOAPdenovo-Trans v1.03 (K-mer = 
61 and 71) (Xie et al. 2014), and the assemblies were BLASTed using BLAST+ (Camacho et 
al. 2009). A second method was used with MITObim v1.9 to bait and assemble the 
mitogenome of Macrocheraia grandis grandis directly referring to that of Dysdercus 
cingulatus. 


Analysis and annotation 

The boundaries of tRNA genes and the secondary structures of tRNAs were identified by 
the Mitos WebServer (Bernt et al. 2013). Protein coding genes (PCGs) and the start and stop 
codons were determined by ORF Finder using invertebrate mitochondrial genetic codes which 
were implemented by NCBI website (https://www.ncbi.nlm.nih.gov/orffinder/). The 
boundaries of 12S rRNA and 16S rRNA genes and the secondary structures of those rRNAs 
were determined through comparison with other insects, including Drosophila melanogaster 
(Diptera: Drosophilidae) (Cannone et al. 2002), Chauliops fallax (Hemiptera: Malcidae) (Li et 
al. 2013), and Neolethaeus assamensis (Hemiptera: Rhyparochromidae) (Guo et al. 2017). 
Codon usage and nucleotide composition were analyzed using MEGA v7.0. 


Results 


Genome organization and structure 

The mitogenome of Macrocheraia grandis grandis is a double-stranded circular DNA 
molecule. A nearly complete mitogenome 14,917 bp in length was sequenced without partial 
sequence of the control region (CR). The mitogenome of Macrocheraia grandis grandis 
contains the typical 37 genes, including 22 tRNA genes, 13 PCGs and 2 rRNA genes (Fig. 1, 
Table 2). This mitogenome is relatively compact with 16 gene overlaps. The longest overlap (8 
bp) is located between tRNA-Trp and tRNA-Cys. Another two overlaps between ATPG/ATP8 and 
ND4/NDAL consist of the same seven nucleotides (ATGATAA). In addition, eight intergenic 
spacers with a total of 127 nucleotides were found, ranging in length from 1 bp to 54 bp. 


Entomotaxonomia (2019) 41(2): 96-113 101 


GEE Protein coding genes 
MME RNA genes 

MH rRNA genes 

II Control regions 


Figure 1. Mitochondrial genome maps of Macrocheraia grandis grandis (left) and Myrmoplasta mira (right). 


Table 2. Organization of Macrocheraia grandis grandis mitochondrial genome 


x Length ] Start Stop Intergenic 
Name Position Strand Anticodon 
(bp) codon codon nucleotides 

tRNA-Ile 1-64 J 64 GAT 

tRNA-GIn 62-130 N 69 TTG -3 
tRNA-Met 130-198 J 69 CAT -1 
ND2 199-1179 J 981 ATA TAA 0 
tRNA-Trp 1178-1243 J 66 TCA -2 
tRNA-Cys 1236-1299 N 64 GCA -8 
tRNA-Tyr 1300-1363 N 64 GTA 0 
COI 1365-2903 J 1539 TIG TAA 1 
tRNA-Leu(UAA) 2899-2963 J 65 TAA -5 
COII 2965-3637 J 673 ATG T 1 
tRNA-Lys 3638-3708 J 71 CTT 0 
tRNA-Asp 3709-3771 J 63 GTC 0 
ATPase8 3772-3927 J 156 ATA TAA 0 
ATPase6 3921-4586 J 666 ATG TAA -7 
COIII 4586-5372 J 787 ATG T -1 
tRNA-Gly 5373-5435 J 63 TCC 0 
ND3 5436-5789 J 354 ATA TAG 0 
tRNA-Ala 5788-5849 J 62 TGC -2 
tRNA-Arg 5850-5914 J 65 TCG 0 
tRNA-Asn 5915-5980 J 66 GTT 0 
tRNA-Ser(GCU) 5980-6050 J 71 GCT -1 
tRNA-Glu 6050-6113 J 64 TIC -1 
tRNA-Phe 6112-6178 N 67 GAA -2 
NDS 6182-7873 N 1692 ATT TAA 3 
tRNA-His 7871-7934 N 64 GTG -3 
ND4 7940-9262 N 1323 ATG TAG 5 
ND4L 9256-9546 N 291 ATT TAA -7 
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Continued Table 2 
YA ength Start Stop Intergenic 
Name Position Strand Anticodon 
(bp) codon codon nucleotides 
tRNA-Pro 9577-9638 N 62 TGG 30 
tRNA-Thr 9649-9706 J 58 TGT 10 
ND6 9761-10225 J 465 ATA TAA 54 
CytB 10225-11358 J 1134 ATG TAA -1 
tRNA-Ser(UGA) — 11357-11426 J 70 TGA -2 
ND1 11450-12376 N 927 ATA TAA 23 
tRNA-Leu(UAG) | 12371-12437 N 67 TAG -6 
16S rRNA 12438-13696 N 1259 0 
tRNA-Val 13697-13763 N 67 TAC 0 
12S rRNA 13764-14554 N 791 0 
Control region 14555-14917 363 0 


The mitogenome of Myrmoplasta mira is a double-stranded circular DNA molecule. The 
nearly complete mitogenome 14,806 bp in length was sequenced without the sequence of the 
5’ terminal of 12S rRNA, control region and 5’ terminal of tRNA-Ile. The mitogenome of 
Myrmoplasta mira contains 42 genes, including the typical 37 genes and five more 
duplications of tRNA-Thr (Fig. 1, Table 3). Ten gene overlaps were found. The longest two 
overlaps (7 bp) are same (ATGATAA) located between ATP6/ATP8 and ND4/NDA4L. Twelve 
intergenic spacers with a total of 211 nucleotides were found, ranging in size from 1 bp to 27 
bp. Between the six duplications of tRNA-Thr, the five intergenic sequences were identical 
(CAAACAACCCTA). 


Table 3. Organization of Myrmoplasta mira mitochondrial genome 


m Length ; Start Stop Intergenic 

Name Position Strand Anticodon a 
(bp) codon codon nucleotides 

tRNA-Ile 1-49 J 49 GAT 
tRNA-GIn 61-129 N 69 TIG 11 
tRNA-Met 142-207 J 66 CAT 12 
ND2 208-1191 J 984 ATC TAA 0 
tRNA-Trp 1190-1258 J 69 TCA -2 
tRNA-Cys 1262-1324 N 63 GCA 3 
tRNA-Tyr 1330-1394 N 65 GTA 5 
COI 1403-2941 J 1539 TIG TAA 
tRNA-Leu(UAA) 2937-3002 J 66 TAA -5 
COII 3003-3678 J 676 ATG T 0 
tRNA-Lys 3682-3753 J 72 CTT 
tRNA-Asp 3756-3818 J 63 GTC 2 
ATPase8 3819-3974 J 156 ATT TAA 
ATPase6 3968-4636 J 669 ATG TAA -7 
COIII 4636-5424 J 789 ATG TAA -1 
tRNA-Gly 5434-5497 J 64 TCC 9 
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Continued Table 3 
Name Position Strand Hd Anticodon siart stop EM: 
(bp) codon codon nucleotides 

ND3 5498-5851 J 354 ATA TAA 0 
tRNA-Ala 5866-5931 J 66 TGC 14 
tRNA-Arg 5948-6011 J 64 TCG 16 
tRNA-Asn 6015-6080 J 66 GTT 3 
tRNA-Ser(GCU) 6080-6149 J 70 GCT -1 
tRNA-Glu 6156-6220 J 65 TTC 6 
tRNA-Phe 6222-6288 N 67 GAA 1 
ND5 6286-7996 N 1711 ATT T -3 
tRNA-His 7997-8060 N 64 GTG 0 
ND4 8061-9381 N 1321 ATG T 0 
ND4L 9375-9659 N 285 ATT TAA -7 
tRNA-Pro 9687-9751 N 65 TGG 27 
tRNA-Thr 9766-9827 J 62 TGT 14 
tRNA-Thr 9840-9901 J 62 TGT 12 
tRNA-Thr 9914-9975 J 62 TGT 12 
tRNA-Thr 9988-10049 J 62 TGT 12 
tRNA-Thr 10062-10123 J 62 TGT 12 
tRNA-Thr 10136-10197 J 62 TGT 12 
ND6 10198-10680 J 483 ATA TAA 0 
CytB 10680-11807 J 1128 ATG TAG -1 
tRNA-Ser(UGA) — 11806-11875 J 70 TGA -2 
ND1 11893-12823 N 931 ATA T 17 
tRNA-Leu(UAG) 12818-12883 N 66 TAG -6 
16S rRNA 12884-14148 N 1265 0 
tRNA-Val 14149-14214 N 66 TAC 0 
12S rRNA 14215-14806 N 592 0 


Rearrangements were found in mitogenomes of both Myrmoplasta mira and Macrocheraia 
grandis grandis. The tRNA-Thr and tRNA-Pro switch their positions, and this is the same as in 
Physopelta gutta and Dysdercus cingulatus, the two other species in Pyrrhocoroidea reported by 
Hua et al. (2008). Moreover, in the mitogenome of Myrmoplasta mira, six duplications of 
tRNA-Thr were found located downstream of tRNA-Pro. The gene orders between ND4 and 
CytB of the four sequenced mitogenomes in Pyrrhocoroidea are shown in Fig. 2. 


Clary & Wolstenholme 1985 


[ND4 [ Npat [T [ P [ Npe [ CyB] Drosophila yakuba (Diptera: Drosophilidae) 

Hua er al. 2008 

[ND4 TND4L |P [ T |ND6 | GytB] Physopelta gutta (Hemiptera: Largidae) 

[ND4 [ Npar |P |T [ ND6 | CyB] Dysdercus cingulatus (Hemiptera: Pyrrhocoridae) 
This study 

[NDp4 [ Npat |P |T |ND6 [ CyB] Macrocheraia grandis grandis (Hemiptera: Largidae) 
[NDETNDILTETETEEEEFDEEFTNDEIGHE] toplasa mira (Hemiptera: Pyrrhocoridae) 


Figure 2. The rearrangements and duplications of tRNA-Thr(T) and tRNA-Pro(P) in Pyrrhocoroidea. 
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Transfer RNAs 


The secondary structures of tRNAs in Macrocheraia grandis grandis and Myrmoplasta 
mira are shown in Figs. 3, 4. According to the algorithm of the Mitos WebServer, most tRNAs 
could be folded into the classic cloverleaf secondary structure, except for tRNA-Thr in 
Macrocheraia grandis grandis, in which the TyC stem simply formed a loop. Furthermore, 
among the six duplications of tRNA-Thr in Myrmoplasta mira, the three nucleotides in the 3’ 
terminal are different between the last one (ACU) and the previous five (CUC). The lengths of 
the amino acid acceptor stem, anticodon stem and anticodon loop are conserved. In contrast, the 


DHU arm and TwC arm are more variable in size. 


Ribosomal RNAs 


In the mitogenomes of Macrocheraia grandis grandis and Myrmoplasta mira, the 16S 
rRNA genes are located between tRNA-Leu and tRNA-Val. And the 12S rRNA genes are 
located between tRNA-Val and the control region. The secondary structures of 16S rRNA both 
consist of six domains (domain III absent in Arthropoda) and 43 helices (Figs. S1, S2). The 
secondary structure of 12S rRNA in Macrocheraia grandis grandis contains three domains 
and 28 helices(Fig. S3). The secondary structure of 12S rRNA in Myrmoplasta mira contains 


three domains and 19 helices (Fig. S4). 
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Figure S1. Putative secondary structure of 16S rRNA in Macrocheraia grandis grandis. 
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Figure S3. Putative secondary structure of 12S rRNA in Macrocheraia grandis grandis. 
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Figure S4. Putative secondary structure of 12S rRNA in Myrmoplasta mira. 


Protein coding genes 

In each mitogenome of Macrocheraia grandis grandis and Myrmoplasta mira, all 13 
PCGs were found. Twelve of the 13 PCGs are initiated with start codons ATN, but the COI 
genes are initiated with TTG (Tables 2, 3). Most PCGs terminated with the common stop 
codons TAA or TAG, but COII, ND5, ND4, ND1 in Myrmoplasta mira and COII, COIII in 
Macrocheraia grandis grandis are terminated with a single T. These unusual start codons and 
stop codons have also been observed in many other true bugs, such as Stictopleurus subviridis 
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(Hemiptera: Rhopalidae) (Hua et al. 2009), Dimorphopterus japonicus (Hemiptera: Blissidae) 
(Li 2014) and Lethocerus indicus (Hemiptera: Belostomatidae) (Devi et al. 2016). 
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Figure 3. Putative secondary structures of tRNAs in the Macrocheraia grandis grandis. Typical Waston-Crick 
bonds are illustrated by shot lines; GU bonds by asterisk. 
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Figure 4. Putative secondary structures of tRNAs in the Myrmoplasta mira mitogenomes. The homologous 
nucleotides which were unsequenced in Myrmoplasta mira are shown by a line. Typical Waston-Crick bonds 
are illustrated by shot lines; GU bonds by asterisk. This kind of notation is also suitable for the secondary 
structures of 12S rRNAs and 16S rRNAs described in the supplementary materials. 


Nucleotide composition and codon usage 
For the mitogenomes of Macrocheraia grandis grandis and Myrmoplasta mira, the A+T 
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contents are 75.5% and 75.2% (Table 4). In Macrocheraia grandis grandis, the whole genome, 
rRNA genes, J strand PCGs and tRNA genes, and N strand tRNA genes are AT-skewed and 
CG-skewed. N strand PCGs are TA-skewed and GC-skewed. In Myrmoplasta mira, the whole 
genome, rRNA genes, J strand PCGs, N strand PCGs and tRNA genes are all AT-skewed and 
CG-skewed. J strand tRNA genes are AT-skewed and GC-skewed. 

No significant differences of AT content were observed among the first (71.4% and 76.1%), 
second (77.6% and 70.1%) and third (75.8% and 77.1%) codon positions either in Macrocheraia 
grandis grandis or Myrmoplasta mira. Synonymous codon usage of twenty amino acids is 
shown in Fig. S5. For most amino acids, the most frequently used codons are NNA and NNU, 
which reflects the A/T bias of nucleotide composition. 


Table 4. Nucleotide composition of Macrocheraia grandis grandis and Myrmoplasta mira 


Feature T% C% A% G% Length A+T% AT-skew | GC-skew 
(bp) 
Macrocheraia grandis grandis 
Whole genome 30.1 15.3 45.4 9.2 14917 75.5 0.20 -0.25 
PCGs 41.6 125 334 126 10988 75.0 -0.11 0.00 
PCGs-J 33.3 155 404 108 6755 73.7 0.10 -0.18 
PCGs-N 548 7.7 223 153 4233 77.0 -0.42 0.33 
First codon position 40.55 143 309 143 3663 71.4 -0.13 0.00 
Second codon position 466 111 310 113 3663 77.6 -0.20 0.01 
Third codon position 375 12.1 383 124 3662 75.8 0.01 0.00 
tRNA genes 348 126 43.1 95 1441 77.9 0.11 -0.14 
tRNA genes-J 348 110 43.5 107 917 78.3 0.11 -0.02 
tRNA genes-N 347 155 424 7.4 524 77.1 0.10 -0.35 
rRNA genes 315 145 465 7.6 2050 78.0 0.19 -0.31 
Myrmoplasta mira 
Whole genome 32.55 15.2 42.7 9.6 14806 75.2 0.14 -0.23 
PCGs 31. 157 42.7 9.9 11026 74.4 0.15 -0.23 
PCGs-J 361 156 372 1141 6778 73.3 0.01 -0.17 
PCGs-N 246 159 516 7.9 4248 76.2 0.36 -0.34 
First codon position 28.8 13.8 47.3 10.0 3676 76.1 0.24 -0.16 
Second codon position 31.2 17.9 389 12.1 3675 70.1 0.11 -0.19 
Third codon position 35.1 155 42.0 7.5 3675 77.1 0.09 -0.35 
tRNA genes 35.7 13.2 40.6 10.5 1747 76.3 0.06 -0.11 
tRNA genes-J 359 115 405 124 1222 76.4 0.06 0.03 
tRNA genes-N 352 17.1 408 6.9 525 76.0 0.07 -0.43 


rRNA genes 343 138 443 7.5 1857 78.6 0.13 -0.29 
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Figure S5. Percentage of synonymous codon usage of each amino acid. 


Discussion 


The nearly complete mitogenomes of Macrocheraia grandis grandis and Myrmoplasta 
mira were sequenced in this study. Both of them contain the translocation of tRNA-Thr and 
tRNA-Pro, which is the same as that of the other two mitogenomes in Pyrrhocoroidea 
(Dysdercus cingulatus and Physopelta gutta). In the mitogenome of Myrmoplasta mira, an 
undiscovered gene rearrangement in Heteroptera was found. It contains 27 tRNA genes with 
six duplications of tRNA-Thr, which are located downstream of tRNA-Pro. 

Tandem duplication-random loss is the most commonly supposed model to explain the 
similar phenotypes in insects (Hua et al. 2008; Song et al. 2016; Ye et al. 2016). We assumed 
that in the mitogenome of the recent common ancestor of Pyrrhocoroidea (Fig. 5a), tRNA-Thr 
— tRNA-Pro duplicated to form an intermediate tRNA-Thr — tRNA-Pro — tRNA-Thr — 
tRNA-Pro (Fig. 5b), and then tRNA-Thr at the first set and tRNA-Pro at the last set were 
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randomly lost and form the current arrangement of the mitogenome in Macrocheraia grandis 
grandis (Fig. 5c). In Myrmoplasta mira, the generation of six nearly identical duplications of 
tRNA-Thr indicates that it is a recent event; the duplication of tRNA-Thr may have happen 
after the translocation of tRNA-Thr and tRNA-Pro (Fig. 5d). 


a 


l Duplication 
b 
* * 
l Random loss 
c 
l Duplication in Myrmoplasta mira 


d 


Figure 5. Putative process of rearrangements in the mitogenomes of Pyrrhocoroidea. 


The translocation of tRNA-Thr and tRNA-Pro is according to the available knowledge an 
autapomorphy of Pyrrhocoroidea. While monophyly of the superfamily Pyrrhocoroidea and the 
family Pyrrhocoridae is well supported by several morphological synapomorphies (Henry 1997; 
Hemala et al. a, b) as well as molecular evidence (e.g., Sudakaran et al. 2015; Gordon et al. 
2016). However, the monophyly of the family Largidae, consisting of two subfamilies, the New 
World Larginae and Old World Physopeltinae, is morphologically supported only by 
symplesiomorhic characters (Stehlík 2013; Hemala et al. a,b) and also its molecular support is 
also not significant (Sudakaran et al. 2015; Gordon et al. 2016). The unique translocation of 
tRNA-Thr and tRNA-Pro thus confirm close relationship of Physopeltinae and Pyrrhocoridae, 
but obtaining a mitogenome of the subfamily Larginae is badly needed to generalize the situation 
in Pyrrhocoroidea and to evaluate the importance of the discovered translocations. Other 
synapomorphies and autapomorphies of groups in Heteroptera may be hidden by limited 
information insofar as only one or two mitogenomes have been sequenced in more than half of 
the families. More mitogenomes need to be explored to acquire a more comprehensive view on 
the evolution of mitogenomes in Heteroptera. 
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